Cogset: a high performance MapReduce engine
نویسندگان
چکیده
منابع مشابه
Cogset: a high performance MapReduce engine
MapReduce has become a widely employed programming model for large-scale data-intensive computations. Traditional MapReduce engines employ dynamic routing of data as a core mechanism for fault tolerance and load balancing. An alternative mechanism is static routing, which reduces the need to store temporary copies of intermediate data, but requires a tighter coupling between the components for ...
متن کاملXSet: A High Performance XML Search Engine
Internet-scale distributed applications (such as widearea service and device discovery and location, user preference management, Domain Name Service) impose interesting requirements on information storage, management, and retrieval. They maintain structured soft-state and pose numerous complex queries against that state. These application are typically implemented using traditional databases, w...
متن کاملNetVCR: A High-Performance Packet Replay Engine
This paper describes the design and implementation of NetVCR, a high-performance packet replay engine for use in evaluating the performance of networking devices. NetVCR is implemented on commodity hardware using widely available open-source software. To achieve high throughput and accuracy, NetVCR employs novel mechanisms for managing trace files and accurate low-overhead timers. In addition, ...
متن کاملMapReduce for the Cell Broadband Engine Architecture
In this paper, we propose the evaluation of MapReduce on the Cell processor by way of the Marchine Cubes application. We argue that the Cell architecture and the MapReduce parallel programming model complement each other well, and that the Marching Cubes application is a good application through which to evaluate this potential synergy. For the interested reader, a preliminary design and plan o...
متن کاملWebPIE: A Web-scale parallel inference engine using MapReduce
The large amount of Semantic Web data and its fast growth pose a significant computational challenge in performing efficient and scalable reasoning. On a large scale, the resources of single machines are no longer sufficient and we are required to distribute the process to improve performance. In this article, we propose a distributed technique to perform materialization under the RDFS and OWL ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Concurrency and Computation: Practice and Experience
سال: 2012
ISSN: 1532-0626
DOI: 10.1002/cpe.2827